Picture for Bei Li

Bei Li

LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance

Add code
May 21, 2026
Viaarxiv icon

MTR-Suite: A Framework for Evaluating and Synthesizing Conversational Retrieval Benchmarks

Add code
May 20, 2026
Viaarxiv icon

Teacher-Guided Policy Optimization for LLM Distillation

Add code
May 13, 2026
Viaarxiv icon

RouteLMT: Learned Sample Routing for Hybrid LLM Translation Deployment

Add code
Apr 24, 2026
Viaarxiv icon

MemoSight: Unifying Context Compression and Multi Token Prediction for Reasoning Acceleration

Add code
Apr 16, 2026
Viaarxiv icon

MSRL: Scaling Generative Multimodal Reward Modeling via Multi-Stage Reinforcement Learning

Add code
Mar 26, 2026
Viaarxiv icon

On the Emotion Understanding of Synthesized Speech

Add code
Mar 17, 2026
Viaarxiv icon

SpanNorm: Reconciling Training Stability and Performance in Deep Transformers

Add code
Jan 30, 2026
Viaarxiv icon

Causal Autoregressive Diffusion Language Model

Add code
Jan 29, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon